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■ For a given ensemble of N independent and identically prepared particles, 
we calculate the binary decision costs of different strategies for measurement 
of polarised spin 1/2 particles. The result proves that, for any given values of 
the prior probabilities and any number of constituent particles, the cost for a 
combined measurement is always less than or equal to that for any combina- 
tion of separate measurements upon sub-ensembles. The Bayes cost, which is 

o 
t> 
o 

■ obtained in a simple closed form. 
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that associated with the optimal strategy (i.e., a combined measurement) is 
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In a problem of experimental design, the task of the experimentalist is to find an optimal 
observational strategy. Ordinarily, one must choose among different strategies before the 



data can be obtained, and hence one must perform a preposterior analysis. When the 
experiment involves a decision among different quantum mechanical states, such an analysis 
is indeed important, since, unlike the classical case, virtual sampling, i.e., repeated samplings 
of the same system, are not generally permitted. 

There are a number of different approaches for finding an optimal strategy. In the 
information-theoretic approach, one typically determines the strategy that maximises the 
mutual information (see, e.g., |l|]), but this is generally difficult, owing to the nonlinear 
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nature of the Shannon information. In the minimax approach |2| , one finds the strategy that 
minimises the maximum cost (or loss) incurred by the decision among different strategies. 
When certain a priori knowledge concerning the nature of the state is available, then one 
may seek a strategy that minimises the expected cost, using a Bayes procedure [§]> 0- 

In the present Letter, we study the Bayesian approach to a binary decision problem for an 
ensemble of polarised spin 1/2 particles. First, we briefly introduce the Bayesian approach 
to quantum hypothesis testing. These notions, developed by Helstrom and others ( [|J, 
H, and U), are then applied to obtain the optimal strategy for a Bayes decision between 
two quantum mechanical pure states, for an ensemble of polarised spin 1/2 particles. In this 
example, we first study the application of quantum Bayes sequential analysis to the ensemble. 
The result is then compared with a combined measurement of the entire ensemble, treated 
as a single composite system. Other strategies consisting of combined measurements of 
sub-ensembles are also considered. The Bayes solution to the problem demonstrates that 
the Bayes cost for separate sequential measurements of the individual particles is the same 
as that of a combined measurement. This result differs from that predicted by Peres and 
Wootters [0. Any other strategy turns out to entail a higher expected cost. Nevertheless, we 
conclude, for the reasons given below, that a combined measurement of the entire ensemble 
is, in general, an optimal one. 

First, consider a decision problem requiring a choice among M hypotheses Hi, ■ ■ ■ , Hm 
concerning a quantum system. Hypothesis asserts that the density operator of the system 
is pk-, (k = 1, ■ ■ ■ , M), and the prior probability of the j-th state is with 

M 

E& = i- (i) 

k=l 

From past experience, one knows that the system is in the j-th state with a relative frequency 
£j. The self-adjoint operators p fc act on the vectors of a Hilbert space TC, are non-negative 
definite, and have unit trace. 

A quantum decision strategy is characterised by a probability operator measure (pom) on 
7i, i.e., a set of M non-negative definite self-adjoint operators IL,- satisfying 
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M 

= 1. (2) 

If this pom is applied to the system when hypothesis is true, then the conditional 
probability of choosing hypothesis Hj is given by 

Pr(X = j\W = k) = TrfalLj) . (3) 

Here, X denotes the random variable that is to be observed, and W, typically being the 
parameter, is the unknown state of nature. 

Now, let Cij be the cost of choosing hypothesis Hi when Hj is true. Then the expected 
cost of the observational strategy specified by the pom {Hj} is || 

M M M 

C = EE^'TrGW = TrE^, (4) 

i=l j=l i 

where the Hermitian risk operators Ri are defined by 

M 

Ri = E s/Vj • ( 5 ) 

A set {Ilj} of pom that minimises the cost (4), under the constraints (2), is defined as 
optimal and the cost is Bayes, i.e., C = C* (the suprescript * here corresponds to the optimal 
strategy). Necessary and sufficient conditions for the optimality of a pom are known to be 
0, the self-adjointness of the operator 

M M 

T = Y,R&i = E n ^ (6) 

i=i i=i 

and the non- negative definiteness of the operator Rj — T for all j — 1, • • • , M. The minimum 
expected Bayes cost is thus 

(7*(£,{n*}) = TrT. (7) 

In a simple case where M — 2, i.e., for binary decisions, one can easily verify |4[] that the 
optimal pom is projection valued, and the Bayes cost becomes 

c*(un*}) = £iCii + 6Ci 2 

-6(Ci 2 - C 22 ) E Vi , (8) 

Vi>0 



where rji are the eigenvalues of the operator p 2 — 7Pi, with 

7 = ^° 21 ~ = _JL . ( 9 ) 

^2{C\2 — C22) 1 — £ 

Here and in the sequel, we choose a 0-1 cost structure; CV,- = 1 — 5ij, i.e., assign cost 1 to an 
incorrect decision and to a correct decision. Also, the prior probability for state 1 is given 
by £1 = f , and hence £ 2 = 1 - f . 

Now, we consider an experiment where a physicist must estimate (decide) the direction 
of polarisation of a given ensemble of N spin 1/2 particles, using a Stern-Gerlach (s-g) 
device. The physicist knows that the particles have been filtered through another s-g device 
with a magnetic field in the x — y plane at a constant angle 9\ or 9 2 from the x-axis, and 
in either case the spin up state has been selected. The physicist can select the orientation 
angle of the detector relative to the x-axis. When the particle passes through the field 
of the detector magnet, the physicist observes either the spin up (head) or spin down (tail) 
state, whereupon he must decide between the alternatives 9\ (i.e., the polarisation direction 
9 = 9i) and 9 2 . We do not specify the values of the angles {9 k }, but the difference between 
the two angles is given by \9 2 — 9±\ — 25. 

First, consider the case where the physicist performs sequential observations of each 
individual spin 1/2 particle. Suppose, for simplicity, that N — 1. The physicist has to 
decide, either before or after the observation, whether the particle is polarised in the 9\ or 
#2 direction. If a decision were to be chosen without any observation, then a Bayes decision 
against the prior distribution of W (in this case, W = 1 or 2) would be optimal. 

Suppose that X (spin 'up' or 'down') is observed before a decision is chosen. Then, the 
decision process for the physicist follows the same procedure as the previous case. However, 
the difference here is that the distribution of W has changed from the prior to the posterior 
distribution. Hence, a Bayes decision against the posterior distribution of W is now optimal. 

The conditional probability for observing the spin up (+1) state, when the state of the 
system is pk, is given by 

b k (<f>) = Pr(X = +1\W = 9 k ) = CO s 2 ( 9 -^). (10) 
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If one fixes the angle 0, then the experiment is entirely analogous to a classical coin tossing 
problem ||, with coins whose bias is given by the above bf.. However, having the freedom 
to choose the angle for each value of the prior £, the physicist must choose an optimal 
direction given by || 

, ,,s , -i /6siri0i-(l-Osin0 2 \ 

(poptiO = tan — . 11 

y£ COSt^i — (1 — 4) cos 2 J 

Hence, we have a problem of tossing quantum coins whose bias is a function of the prior 
probability £. 

Having chosen the optimal angle op t, the Bayes decision rule specifies that 6\ is to be 
chosen if the spin up state is observed, and 8 2 otherwise. The Bayes cost against the prior 
£, when N = 1, can easily be obtained by calculating the eigenvalues of pi — jpi, with the 
result || 
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C*(t,l) = -(l-V2e-(2 + cos2^ + lj . (12) 

Now, suppose that N = 2, and the result of measurement of the first particle has been 
obtained. As mentioned above, the physicist must follow the same procedures as in the case 
N = 1, with the posterior distribution £(±) instead of the prior £. From Bayes' theorem, 
the posterior probability that 9 — 9\ is given by 

£(+) = bl ^'^ (13) 

iK ' ' &i(0)-£ + & 2 (0) •(!-£) 1 ) 



or 



{H = d-MW-t (14) 

according to the outcome (+ or — ) of the first measurement. The optimal orientation 
angle, before performing the second measurement, is now given by </> OJ rf (£(+)) or (fi op t(£,(— )), 
accordingly. The Bayes cost for this case (N = 2) is given by the weighted average, i.e., 

c* = 6 1 ec*(e(+),i)+6 2 (i-o<5*(e(-),i) ■ 
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Next, we consider an arbitrary number N of particles. Again, the procedures are the same 
as above, except that the prior is now replaced by one of the 2 N ~ 1 posteriors [£(+ + •••+), 
• ■ •] , after observations of N — 1 particles. In a classical Bayes decision procedure [0 , [[| , it is 
difficult (or impossible) to obtain the Bayes cost as a closed function of N. The reason is that, 
first, one must study the tree jnj of the posterior distributions, with branches proliferating 
as ~ 2 N . To each branch (i.e., posterior) of the tree, one associates the cost C*(-,l), and 
then calculates the weight (probability) for the sequence of outcomes associated with that 
branch. After these considerations, one can, in principle, obtain the weighted average of 
the cost, which involves 2 N ~ X terms. (Note that, for classical coins, the branches of the 
posterior tree do recombine and hence proliferate as ~ N. However, the weights associated 
with the branches do not recombine, and therefore one cannot avoid the consideration of 
2 1 terms.) 

In the case of our "quantum coins" , the situation appears even worse, since, after each 
observation, the physicist must turn the device in accordance with formula (11). This results 
in changing the bias bk(4>) of the "coins" at each stage, and hence one must also incorporate 
the bias tree (which proliferates ~ 2^). However, it turns out that this optimal orientation 
forces the posterior tree to recombine into two branches, i.e., 

£(n,±) = ~(l± yjl- 4f(l-0 cos 2 ^- 1 ) 5) , (15) 

where ± corresponds to the outcome of the last (n — 1-th) trial being spin up (+) or down 
(— ). This result can be proven by induction as follows. First, for n = 1, it is easily verified 
that £(1, ±) = £(±) as given in (13) and (14). Next, assume that the last (n — 1-th) outcome 
of the trial is (— ), and that the posterior is given by the above £(n, — ). Then, if the next 
trial outcome is (+), follows from Bayes' theorem, that the posterior distribution, after n+1 
observations, is given by 

f( ,x W)-g(w,~) 

^ J •£(",-) + W) • (i - -)) ' 

with = <f) opt (£(n, — )). After some algebra, one can show that the above £(• • • — h) = 
£(71+ 1, +). The other three cases [£(• • ), etc.] can also be treated in the same manner. 
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Although the weights for different branches neither recombine in the quantum case, since 
(7*(£(ra, +), f) = C*(£(n, — ), f), the final average cost is just C*(£(n, ±), f ) times the sum of 
all the different weights (which is just 1), and hence we finally deduce that the Bayes cost 
for sequential observations is 

C*(t,N) = (7*(£(7V-1,±),1) 

= \ (l - v/l - 4£(f - cos 2 ^ ) , (16) 

for either value of the N — 1-th outcome (+ or — ). 

Next, consider the case where the physicist treats the entire ensemble as a single com- 
posite system. The total spin of a system with N particles is just N/2, and the density 
operator for a spin N/2 particle polarised in the direction n = (cos#, sin 9, 0) is given by 

W))mn = l~ N ^NC m N C n e~^ e , (17) 

where (n,m) = 0, • • • , N. According to the result in (8), one must find the eigenvalues of the 
matrix p 2 — 7Pi in order to obtain the Bayes cost. We first show that the matrix p 2 — 7Pi is 
of rank two, and thus has only two non-zero eigenvalues. Define two vectors u = {u n } and 
v = {v n } by 

u n = 2-^ 2 V^a> 01 , (18) 

and 

v n = 2^ 2 V^e^ 2 . (19) 

Then, (pi) mn = u* m u n and (/S 2 ) m „ = v* m v n . Since the inner product u • u* = v • v* = 1, one 
obtains 

PlU* = ^2(pl)mnU* n = U* 

n 

and similarly, p 2 v* = v*. Now, let w and A be an eigenvector and the corresponding 
eigenvalue of the matrix p 2 — 7Pi, i.e., 
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(pi - 7P 2 )w = Aw . (20) 

We may expand the eigenvector w in terms of a basis that contains either u* or v*, i.e., 

w = ciu* + or w = c 2 v* + v* ± . Here, denotes some vector orthogonal to u*, and 
similarly for v]_. However, since piu^_ = p 2 v]_ = 0, we have 

Aw = ciu* — 7c 2 v* . (21) 

Therefore, the matrix p 2 — 7 pi is of rank two, as claimed. On the other hand, if we form 
the inner product of the two vectors w = ciu* + and u, we obtain 

w u = Cl = y-^c 2 (v*-u) , (22) 



and similarly, 



wv = c 2 = ^-(u* • v) - ^c 2 . (23) 
A A 



Without any loss of generality, we may now set c± — 1, and then by eliminating c 2 from the 
above equations, we obtain the eigenvalues of the matrix p 2 — 7pi, i. e., 

A ± = i{(l-7)±V / ( 1 -7) 2 -4 7 (A 2 -l)} , (24) 

where 



A = (v*-u)(u*-v) 



N 2 



cos 27V (5) . (25) 



— 0~N \ " n „2m 

— 1 N^mZ 

m=0 

Therefore, the binary Bayes decision cost for a spin N/2 particle is 

C*(£,N) = y/l- 4£(l-0 cos 2 ^ <f) . (26) 

One immediately observes that the above cost (26) is the same as that obtained from sequen- 
tial analysis, given by (16). Hence, the Bayes solution to our optimisation problem states 
that a combined measurement is as advantageous as sequential measurements. These two 



8 



strategies, however, are not the only ones, and many other partially combined measurement 
procedures are possible. However, in the present formalism of sequential analysis, the only 
effect of any intermediate measurements, either partially combined or not, consists in updat- 
ing the posterior distributions. Since the Bayes cost is a monotonically decreasing function 
of the number of updating steps, this implies that any partially combined measurements will 
increase the cost. Therefore, we may now conclude that the optimal measurement strategy 
consists in either performing a combined measurement of the entire ensemble or perform- 
ing sequential measurements of the individual particles. Any other strategies will result in 
higher costs. 

This result is quite different from that expected by Peres and Wootters, who conjectured 
that sequential measurements can never be as efficient as a combined measurement 
However, it is important to note that their conjecture is based upon an information-theoretic 
approach, and the solution of an optimisation problem using a Bayesian approach can yield 



a different result. Massar and Popescu ||11|| , on the other hand, have proved the above 



mentioned conjecture explicitly for the case N = 2. The method used therein is effectively 
similar to a Bayesian approach, without the use of the prior distributions. However, when 
a prior distribution is available, the Bayes solution is known to be optimal in general ||. 
If prior knowledge is not available, one can still apply the Bayesian approach, using a non- 
informative prior. The analysis of such cases is, however, beyond the scope of the present 
Letter. 

Throughout the present Letter, we have only considered the cost associated with making 
decisions. In any practical situation, on the other hand, one must take into consideration 
other costs (e.g., the observational cost, the cost of analysing the results, etc.). In our exam- 
ple of sequential analysis, for example, at each stage before performing an observation, the 
physicist must analyse the previous results in order to determine the optimal turning angle. 
One might argue that || the analysing cost can be ignored, since, after all, scientists are so 
underpaid that the cost of their labors is usually negligible! Nonetheless, the observational 
costs cannot be ignored in general. Assuming the linearity of the utility function (e.g., that 



the total cost is just the sum of the decision cost and the observational costs), it is clear 
that any separate measurements will result in a higher total cost, since the decision cost for 
optimal sequential measurements (i.e., sequential measurements with optimal angular orien- 
tations) can never be lower than that for a combined measurement. Therefore, we conclude, 
after these considerations, that a combined measurement is optimal in general. 

In connection with the decision problem for classical coins which was briefly mentioned 
above, it is interesting to note that all the quantum results obtained by calculating the 
eigenvalues of the density operators can, in principle, be recovered from purely classical 
calculations, even for sequential measurements, if and only if the spins of the particles 
concerned are 1/2. That is, provided one does not perform any combined measurements, 
the results can be obtained from classical calculations. More details of this, as well as 
a treatment including the observational costs, may be found in ||. (See, also || for a 
comparison between classical and quantum coin tossings.) 

The authors acknowledge their gratitude to J. T. Key, and J. D. Malley for useful dis- 
cussions of the foregoing topics. 
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